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(54) TItte: LABELING GRAPHICAL FEATURES OF DRAWINGS 
(57) Abstract 

A method is provided for automatic placement of labels for a given graph layout or 
1. Even though in practice a label is usually associated with a line (edge), point (node) or 
area, this method can be extended to produce labeling solution for any graphical feature with 
explicit geometric representation (in two or three dimensions). This method first finds a labeling 
solution for a set of graphical features G by eliminating a subset of the set of potential label 
placements for any member G, and reducing the labeling problem to a maximum matching 
problem of a bipartite graph. Next, if there are graphical features in G that have no label 
placement assigned to them yet, a backtracking algorithm may be used to improve the space 
available for the labeled graphical features. It may be shown that the labeling problem is 
NP-hard if any graphical feature in G is a line or point. As a result, the GFLP problem cannot 
be solved in polynomial tune, but requires the application of well-devised heuristics. 
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LABELING GRAPHICAL FEATURES OF DRAWINGS 
BACKGROUND OF THE INVENTION 
1. Held of the Invention 

This invention relates to systems for the automated placement of labels 
5 in relation to graphical features of two- or three-dunensional drawings, either 
drawings produced by layout tools or drawings that in a scaled version depict 
"real" drawmgs (i.e., geographical maps and technical maps). More precisely, 
this invention relates to automatically producing a labeling solution for a given 
graph layout or map. 

10 2. State of the Art 

Graph drawing is a general area of computer science or mathematics that 
attempts to develop algorithms which may be used to produce aesthetically 
pleasing "drawings" of graphs from abstract topological information about the 
nodes in a graph and the edges that connect them together. A related discipline 

15 is that of graph labeling. The need for automated placement of labels in 

relation to graphical features first emerged from placing labels in relation to 
features of technical, and particularly geographical, maps. The furst attempts to 
automate the labeling process are ahnost twenty years old. 

In recent years, a growing recognition of a large number of applications 

20 of graph drawing have caused it to receive increased attention. Graph drawings 
find application in many different areas including, among others, such areas as 
database design (e.g. entity relationship diagrams), software engmeering, 
CASE, software debugging, and communication networking. The labelmg of 
the graphical features of a drawing in most of these applications is essential, 

25 since it gives important information about the relations represented by the 
drawing. 

As used in the present specification, the term "label" is used to refer to 
text or other indicia to be placed on a drawing. The term "label placement" is 
used to refer to one or more selected areas within which a label may be placed. 
30 The term "label assignment" is used to refer to a label placement that has been 



- 1 - 



wo 98/20453 



PCT/US97/19167 



chosen. Finally, the term "iabeling solution" is used to refer to a set of label 
assignments for multiple labels such that the label placements satisfy specified 
criteria. 

Most of the past research on the labeling problem has been done for 
5 labeling geographic and technical maps. Most prior-art techniques devised to 
solve the Graph Feature Label Placement (GFLP) problem are trivial in nature. 
Previous approaches have used greedy algorithms, exhaustive-search 
(backtracking) algorithms, algorithms that simulate physical models (i.e., 
Simulated Annealing), and reduction of the labeling problem to a variant of 0-1 
10 integer programming. These techniques either do not produce the desired 
results, due to theu: tendency to get trapped in local optima, or they take 
exponential time. 

Almost all of the algorithms devised to solve the GFLP problem (or a 
restricted version of the GFLP problem) have been focused more on the 

15 problem of labeling points and less on the problem of labeling lines. 

In geographical maps, the labeling of lines (the Edge Label Placement, 
or ELP problem) is less restrictive than the labeling of points (Node Label 
Placement, or NLP problem), due to the fact that lines are usually long, and 
thus offer many possible positions for placing labels. The edge density of any 

20 drawing can vary. Hence, edge labeling algorithms, to be of practical 

importance, must perform well even for drawings with very high edge density. 
Since the ELP problem is NP-hard, any effective approach to solve the ELP 
problem must be durected toward devising heuristics for that problem. 

SUMMARY OF THE INVENTION 
25 The present mvention, generally speaking, provides a method for assigning 

label placements to features of a graph drawing, using various heuristics. This 
method can be extended to solve the line labeling problem (or edge labeling 
problem for graph layouts) in a more sophisticated and more effective way than 
previous methods. The advantage of this method is that it finds a labeling 
30 assignment, at least in the first phase of the algorithm, that takes into accoimt 
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the drawing as a whole. In other words, it tries to find a solution using 
"global" information about the drawing. This dramatically differs from other 
approaches to solving the GFLP problem, which either start from random initial 
label placements or placements that disregard the overlaps between them. In 
5 accordance with one aspect of the present method, it provides sufficient 
conditioiis that guarantee an optimal labeling solution. In accordance with 
another aspect of the present method, it provides heuristics that explore the 
labeling space by trymg to construct the aforementioned sufficient conditions in 
order to obtain a good labeling solution. 

10 BRIEF DESCRIPTION OF THE DRAWING 

The present invention may be further understood from the following 
description in conjimction with the appended drawing. In the drawing: 

Figure 1(a) is a diagram of potential edge label placements in relation to 
an edge (1, 2) of a graph drawing; 
15 Figure 1(b) is a diagram of a contmuous edge label placement space in 

relation to the edge (1,2); 

Figxure 2 is a perspective view of a computer system with which the 
present invention may be used; 

Figure 3 is a block diagram of the computer system of Figure 2; 
20 Figure 4 is an example of a matching graph; 

Figiure 5 is an overall flowchart of a labeling method according to the 
present invention; 

Figure 6(a) is a diagram of a graph drawmg having edge crossings; 
Figure 6(b) is a diagram of the graph drawing of Figure 6(a) with edge 
25 crossings replaced by virtual nodes; 

Figure 6(c) is a diagram of the graph drawing of Figure 6(b) with 
clusters identified by different shadings; 

Figure 7(a) is a diagram of a cluster of a drawing in which labeling 
spaces for each graphical feature (e.g., edge) have been partitioned into disjoint 
30 regions having no overlaps, two overlaps, or more than two overlaps; 
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Figure 7(b) is a diagram similar to tliat of Figure 7(a) but in which 
potential label placements have been restricted to be parallel to the horizontal 
direction; 

Figure 8(a) is a diagram corresponding to Figure 7(b) in which potential 
5 label placements are shown; 

Figure 8(b) is a diagram of a portion of an overlap graph corresponding 
to the diagram of Figure 8(a); 

Figure 9(a) is a diagram of a cluster divided into three sets of strip 
indicated by different styles of lines, each strip accommodating a label 
10 placement for a corresponding edge; 

Figure 9(b) is a diagram corresponding to Figure 9(a) in which potential 
label placements are shown; 

Figure 9(c) is a diagram of a portion of an overlap graph corresponding 
to the diagram of Figure 9(b); 
15 Figure 10(a) is a diagram of a first style of label that may be used when 

a label cannot be placed without overlap; 

Figure 10(b) is a diagram of a second style of label that may be used 
when a label cannot be placed without overlap; 

Figure 10(c) is a diagram of a third style of label that may be used when 
20 a label cannot be placed without overlap; 

Figure 11 is a flowchart of a heiuistic, referred to as Heuristic A, used 
to form an overlap graph describiiig potential label placements; 

Figure 12 is a flowchart of a heuristic,- referred to as Heuristic B, used 
to form an overlap graph describing potential label placements; 
25 Figure 13 is a flowchart of a heuristic, referred as the Heuristic C, used 

to reduce an overlap graph to a matching graph; 

Figure 14 is a flowchart of a heuristic, referred to as Heuristic D, used 
to find a minimum-weight, maximum cardinality matching of the matching 
graph; 

30 Figure 15 is a flowchart of an overall algorithm in accordance with the 

present invention; 
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Figure 16 is an example of an orthogonal graph layout; 
Figure 17 is an example of a hierarchical graph layout; 
Figure 18 is an example of a symmetric graph layout; and 
Figure 19 is an exan^le of a labeled graph drawing produced using the 
methods described herein. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention, in a preferred embodiment, may be embodied in 
software as part of a Graph Layout Toolkit of a type sold by the present 
assignee. The Graph Layout Toolkit is a family of portable, automated, 
object-positioning libraries designed for integration into Graphical User 
Interface (GUI) application programs. It makes graphs more readable by 
choosing logical (x,y) positions for nodes and a suitable routing for edges. The 
Graph Layout Toolkit functions as a software component. Since it does not 
include any graphics code, it may be used with any GUI toolkit or windowing 
system. The Graph Layout Toolkit of the assignee adheres to modem 
object-oriented programming standards and C -H + standards-based development, 
and is therefore portable to any environment with a C++ compiler supporting 
the ANSI C + + specification. An ANSI C API is also provided for C 
application programs: it parallels the capabilities of the interface provided as 
part of the C+ + API. The Graph Layout Toolkit is not dependent on any 
particular hardware platform. Since most compilers for other languages 
provide mechanisms to make calls to the C language, the ANSI C API allows 
programmers to mstall the Graph Layout Toolkit with most application 
programs. Of course, the methods of the present invention may be practiced in 
software, in any programming language, or in hardware. The software 
embodiment described herein is simply one convenient implementation. 

Referring to Figure 2, a computer system which may be used to 
implement the present design techniques is shown, having a monitor 210, a 
keyboard 220 and a mouse 230. A mouse cursor 240 appears on the screen and 
moves as the mouse is moved. The internal hardware of the computer 
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workstation of Figure 2 may be represented in simplified block diagram form as 
shown in Figure 3. The computer system has a central processor 302, a system 
memory 304, a display 306, a printer 308, a keyboard 310, a pointing device 
321, a disk storage subsystem 314, and I/O controller 316, and interconnecting 
5 means such as a system bus. The computer workstation may be part of a larger 
network of computers interconnected by a local area network (LAN), for 
example. The computer configuration described should be understood as being 
merely exemplary of a wide variety of computer configurations and 
architectures in which the present mvention may be employed. 

10 < Labeling Quality > 

The GFLP problem tries to assign placements to (text) labels for a 
predefined set of graphical features of a drawing, such that each label 
communicates information about its corresponding graphical feature in the best 
possible way. There are three basic rules that, in general, give an accurate 

15 assessment of the quality of a label assigimient: 

1. No overlap of a label with other labels or other graphical features of 
the layout is allowed. 

2. Each label can be easily identified with exactly one graphical feature 
of the layout (i.e., the assignment is unambiguous). 

20 3. There is a ranking of all the label placements for every graphical 

feature. 

The first two rules are used to evaluate, the clarity of the association of 
the labels to their corresponding graphical features. The third rule is used to 
evaluate the overall quality of a given label assignment with respect to the 

25 ranking of the label placements. The ranking of the potential label placements 
for a graphical feature typically captures an aesthetic preference with regard to 
label positioning, an essential criterion for the labeling quality of geographic 
(and to some extent technical) maps. It also allows the introduction of 
problem-specific constraints that are specific to technical maps and layouts. 

30 E.g., for the labelmg of the edges of a graph layout, a problem-specific 
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constraint might be that the label of an edge must be closer to the source (or 
destination) node. 

Since the drawing is fixed, there are instances where even an optimal 
labeling solution might produce label assignments that do not strictly follow the 
5 above rules. In this case, there should be a way of evaluating how good a 

labeling solution is. This evaluation is made possible by assigning to each label 
placement a quantity that evaluates the quality of that label placement. This 
quantity is called COST. COST(i,j) is a fiinction diat gives the penalty of 
having assigned label placement j to graphical feahire i in the final labelmg 
10 solution. The COST for each label placement j is a linear combination of: 

1 . The penalty with respect to the ranking of j; and 

2. The penalty which reflects the severity for j of the violation of the 
first two basic rules. 

< Label Placement > 
15 In order to completely describe the labeling problem, one must define 

how to construct the set of potential label placements for any graphical feature. 
The labeling of the edges of a drawing of a graph will be used as an example to 
illustrate some of the basic definitions necessary for this method. 

First, rules must be specified that interpret the "Basic Rules For 
20 Labeling Quality, " since for different graphical features, different rules can 
apply. In the case of the ELP problem, those rules might be: 

1. A label placement respects the first rule if it does not overlap any 
graphical feature. It is allowed to be adjoined to the edge that it belongs to. 

2. A label placement respects the second rale if it touches the edge that 
25 it belongs to. 

In Figure 1(a), label placements A, B, C, D and E are potential label 
placements for the edge (1, 2). Label placements A, B and D follow the first 
two rules, and hence their cost depends only on their ranking. Label placement 
C intersects the edge that it belongs to, which is a violation of the first rule. If 
30 a drawing has multiple edges with labels like E floating between edges, it is 
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impossible to have a clear understanding of which edge that label belongs to. 
Therefore, the second rule must be followed such that labels like E are avoided 
in a final labeling solution. 

The set of all potential label placements for a given edge can be defined 
5 in the discrete or continuous labeling space. In the discrete labeling space, the 
set of all potential label placements is finite, and each label placement is 
identified by its position in the layout, as exemplified in Figure 1(a). In the 
continuous labeling space, the set of all potential label placements is infinite. 
For each edge there is defined a region which is bounded by a closed line, such 

10 that each potential label placement for that edge is contained within that region. 
Candidate label placements are label placements that have at least one 
intersection point with their associated edge. By imposing this restriction, 
labels like E in Figure 1(a) are avoided. 

The shaded region around edge (1, 2) in Figure 1(b) is the continuous 

15 labeling space for that edge, and any label placement inside that shaded region 
is a candidate label placement for that edge. 

One major difference between the PLP (Point Label Placement) and ELP 
(Edge Label Placement) problems is in the definition of a legitimate potential 
label placement. First, a potential label placement is a label placement that can 

20 be part of a final labeling solution. In the PLP problem, the only potential 

label placements are label placements that do not overlap any graphical feature 
of the layout except other label placements. But in the ELP problem, potential 
label placements include not only label placements that follow the same rules as 
in the PLP problem, but also label placements that overlap their associated 

25 edges. 

In Figiure 1(a), label placements like A, B and D are preferable; but 
certainly label placements like C, which overlaps its associated edge, are 
acceptable, provided that the appropriate cost is charged. 

< Formulation Of The GFLP Problem > 
30 The GFLP problem is an optimization problem, since the objective is a 

labeling solution of minimum cost. Each label placement in a final solution 
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carries a penalty calculated by the COST function. The objective is to find a 
set of label placements, one for each graphical feature, that yields minimum 
total cost. 

THE OPTIMAL ELP PROBLEM: Let G be a drawing, and let E be the 
5 set of aU graphical features to be assigned label placements. Also, let W be the 
set of all potential label placements for any given graphical feature in E, and let 
k be the number of labels per graphical feature. 



PROBLEM: Find a labeling solution that minimizes the following 



function: 




where 



. 1, if label placement j is assigned to grapiiical feature i,. 
0, otherwise 



'} 



and 



2 Pi^i) for 0<i<|£j + l 
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< Solving The GFLP Problem > 
The GFLP problem can be viewed as the problem of finding a matching 
label placement for each graphical feature, v/here each label placement is free 
of overlaps. Label placements may be matched to graphical features by 
5 formulating the GFLP problem as a graph matching problem. This graph 

matching problem will be the first phase of the present algorithm that solves the 
GFLP problem. 

It is advantageous to group label placements together on the basis of 
their mutual interdependence. Each label that belongs to a group can 

10 potentially overlap at least one other label within the group. This idea may be 
further restricted by introducing the following relation R. 

RELATION R: For any potential label placements x, y, if x R v then V 
z, where z is a potential label assignment, z overlaps x if and only if z overlaps 
y as well. This relation restricts the number of qualified label placements that 

15 can be included in a group. Each group is actually a class for relation R. 

Furthermore, each label that forms a single class in R does not, by definition, 
overlap any other label. A set W of potential label placements for respective 
graphical features of the input drawing is constructed by reducing the size of 
the set Q of all potential label placements of graphical features in the drawing. 

20 Any label in the set W has the property that it belongs to exactly one 

class of relation R, and ia a final labeling solution (with no overlaps of label 
placements) only one label placement of each class of relation R can be part of 
the final solution. 

By introducing relation R, the GFLP problem may be formulated as a 
25 maximum-cardmality, minimum-weight matching problem of a bipartite graph 
Gmatching (VGF»Vhieictass>Emrtchtag) where: 

1) Each node in Ygp corresponds to a graphical feature of the drawing; 

2) Each node in VbbeK;hiss corresponds to a class of relation R; and 

3) Each edge i in E^^taang connects a node j in (graphical feamre j 
30 of the drawing) to those nodes v, in Viabdciass that have as members label 
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placements of j. The weight of each edge i is the ranking of the label 
placement (y^. 

An example of a matching graph is shown in Figure 4. A Ime divides 
the graph into two portions such that each edge in the graph crosses the line. 
5 This is the defining feature of a bipartite graph. Nodes in the upper half of the 
graph represent graphical features with respect to which labels are to be placed. 
Each node in the lower half of the graph represents a class of conflicting 
(overlapping) label placements. Only one label placement from each class can 
be part of the fmal labeling solution. An edge between a node in the upper half 

10 of the graph and a node in the lower half of the graph denotes that the class 
represented by the node in the lower half of the graph contains one or more 
acceptable label placements for the graphical featore represented by the node in 
the upper half of the graph. A node in the lower half of the graph may have 
multiple incident edges, each from different nodes in the upper half of the 

15 graph, denoting that the class represented by the node contains acceptable label 
placements for each of multiple graphical features. Since only one label 
placement may be chosen from among the class represented by the node, all but 
one incident edge of the node must be eliminated in the course of finding a 
labeling solution. 

20 The optimal labelmg solution with respect to the set W of label 

placements may be obteined if a maximum-cardinality, mmimum-weight 
matching of the matching graph may be found. Cardinality refers to the 
number of edges of the graph, i.e., the number of graphical features for which 
a label placement has been found. In the maximum-cardinality, 

25 minimum-weight matching, therefore, the maximum possible number of nodes 
in the upper half of the graph each have a single edge connected to a respective 
node in the bottom half of the graph, where the total cost of the set of edges, 
for a given cardinality, is minimized. 

One major factor in the feasibility of the foregoing approach is the 

30 construction of the set of label placements, W. As previously mentioned, each 
of these label placements belongs to exactly one class of relation R. At the 
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beginning there can be an infinite number of label placements for each 
graphical feature. 

The first step is to reduce the niunber of label placements to a smaller 
set Q such that each graphical feature has a finite number of label placements 
5 associated with it. This reduction is achieved by working with the discrete 
labeling space. Also, each label placement is associated with exactly one 
graphical feature. 

The finite set of label placements may be described by an overlap graph 
GJOV} (V_{OV}, E_{OV}) where: 
10 1. Each node in V_{OV} corresponds to either a graphical feature of a 

label placement; 

2. Each edge in E_{OV} either connects a node corresponding to a 
label placement to a node corresponding to an edge that this label is assigned 
to, or connects two nodes that correspond to label placements that overlap. 

15 The overlap graph serves as a model to described the set Q of label 

placements. The final goal in the first phase of the algorithm is to reduce the 
initial set of label placements Q to a smaller set of label placements W such that 
W is one side of the matching graph. Finding the set W of label placements 
allows the matching graph to be constructed. Because the matching graph is a 

20 bipartite graph, polynomial algorithms exist to find the maximum cardinality 
minimum weight matching of that graph. 

The matching yields ttie optimal solution for the GFLP problem on the 
set W of label placements. Clearly, once the matching graph as been obtained 
(implying that the set W has been defined), the problem thereafter can be 

25 solved optimally. But the GFLP problem is NP-hard, indicating that it is the 
creation of the overlap graph and the its reduction to the matching graph that 
boimds the solution space of the GFLP problem in such a way that the new, 
restricted solution space can be explored in polynomial time. The following 
very important conclusion may therefore be drawn with respect to the GFLP 

30 problem and the methods described herein: 
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1. Once the matching graph is computed, then the GFLP problem can 
be solved optimally in polynomial time. 

2. There cannot be a polynomial-time algorithm for creating the 
matching graph (from which the optimal solution to the GFLP problem can 

5 always be found). 

Hence, the main steps for reducing the complexity of the GFLP problem 
are (a) the construction of the overlap graph, and (b) the reduction of the 
overlap graph to the matching graph. 

The following description presents a framework for reducing the 

10 complexity of the GFLP problem along with examples of heuristics that can be 
used within the framework. Referruig more particularly to Figure 5, given an 
input layout, first an overlap graph is constructed that accounts for overlap of 
potential label placements. An example of an overlap graph is shown in Figure 
8(b). Nodes in the upper half of the graph represent graphical features, e.g., 

15 edges. Nodes in the lower half of the graph represent individual potential label 
placements for those edges. Edges are used to associate graphical features with 
their potential label placements. Overlaps between respective pairs of potential 
label placements are represented by respective edges between corresponding 
nodes in the lower half of the graph. 

20 The main algorithm for solving the GFLP problem may have two 

phases: 

In Phase 1, the set W ^ Q of potential label placements for the iiq)ut 
drawing G is built. Next, there is found a labeling solution with respect to that 
set W of label placements by solving a variant of the matching problem for 

25 bipartite graphs. 

In Phase 2, heuristics are used to improve the solution created in Phasel 
by looking for solutions within the enture solution space, and by removing some 
of the restrictions with respect to the quality of the label placements. The 
Phase 2 algorithm may be based on local improvement techniques, guided 

30 depth-first search with backtracking, etc. The goal of Phase 2 is to improve the 
size of the solution obtained in Phase 1. 
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The overlap graph can be constructed several ways. For example. 
Heuristic A, described hereinafter in relation to Figure 11, or Heuristic B, 
described hereinafter in relation to Figure 12, can be used to construct the 
overlap graph. In a special case to be described, the overlap graph is identical 
5 to the desired matching graph. In the general case, the overlap graph is 

transformed to the matching graph using Heuristic C, described hereinafter in 
relation to Figure 13. 

The matching graph can be constructed in several ways. For example. 
Heuristic C, described hereinafter in relation to Figure 13, can be used to 
10 construct the matching graph. 

In general, known methods may be used to find the 
maximum-cardinality, minimum-weight matching of the bipartite matching 
graph. In the special case referred to above, a fast solution to this problem 
may be obtained using Heuristic D, described hereinafter in relation to Figure 
15 14. 

The steps described thus far correspond to Phase 1 of the algorithm. If 
after Phase 1 a placement has been found for all graphical features, then the 
program concludes. If not, then the program enters Phase 2. In Phase 2, 
conventional backtracking techniques may be used in an attempt to further 

20 improve the solution. 

The overall program flow, including both Phase 1 and Phase 2, is 
depicted in Figure 15. The program receives as its input a drawing G having a 
set of graphical features GF. The program begins by executing Phase 1. In 
Step 1 thereof, an overlap graph Gov is computed for G. The results of Phase 

25 1 depend upon the set of discrete labels used to form the overlap graph. Phase 
1 may therefore be repeated in a loop until a complete labeling solution is 
found, until a partial but sufficient labeling solution is found as defined by a 
user-set threshold, or until a number of iterations have been performed. In the 
first iteration, the maximum number of potential label placements is used. In 

30 . each succeeding iteration, potential label placements are removed from the set. 
For example, label placements that produce the most overlaps may be removed. 
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Label placements may be removed if they belong to graphical features that, 

following removal, will still have a number of label placements associated with 

them in the overlap graph. 

In Step 2, the overlap graph is reduced to a matching graph. In Step 3, 
5 the matching graph may be further reduced, if specified, in a manner described 

in greater detail hereinafter. 

In Step 4, the matching algorithm is run on the matching graph. If a 

complete labeling solution has been found, if a partial but sufficient labelmg 

solution has been found as defined by a user-set threshold, or if a specified 
10 number of iterations have been performed, then the loop is exited and, if 

necessary, Phase 2 processing is begun. 

In Step 1 of Phase 2, a backtracking algorithm is run. Running the 

backtracking algorithm may allow a complete labeling solution to be found. In 

Step 2, if there are still graphical features with no label placement assigned, 
15 then the restrictions on the overlapping of labels with graphical features may be 

relaxed. For example, some percentage of overlap, say 5% or 10%, may now 

be deemed acceptable. The backtracking algorithm is then run again. Finally, 

in Step 3, if there are still graphical features with no label placement assigned, 

then different styles of labels may be used m order to facilitate placement, as 
20 shown in Figure 10(a), Figure 10(b) and Figure 10(c). Another possibility is to 

simply choose label placements for the remaining features with less 

consideration of label quality. 

An overview of the present labeling algorithm having been presented, 

various steps within Phase 1 and Phase 2 of the algorithm will now be 
25 examined in greater detail, m particular the heuristics used to form the overlap 

graph, reduce it to a matching graph, and fmally fmd a maximum-cardinality 

matching for the matching graph. 

In Phase 1, the basic idea is to restrict the set of potential label 

placements for drawing G to a set W such that each label placement in that set 
30 belongs to a class of relation R. This problem may be decomposed mto two 
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subproblems: First create the discrete labeling space; Second, reduce that space 
to set W. 

To create the discrete labeling space, for each graphical feature i in G, 
its continuous labeling space is de&ied, denoted as LS(i). Any label of i must 
5 be entirely inside LS(i). The continuous labeling space for each graphical 
feature is reduced by not allowing label placements to overlap any graphical 
feamre of the layout (except another labeling space). This can be done easily 
by removing any part of a labeling space that overlaps a node or an edge. 
This reduction splits the continuous labeling ^ace for each graphical 

10 feature of G into a set of mutually disjoint regions. Those regions for graphical 
feamre i are denoted as SubLS(i,j) where 0 < j < k + 1, and k is the number 
of the disjoint regions of i. The union of all of the mumally disjoint regions of 
a graphical feature i is the reduced labeling space for graphical feature i, 
denoted RLS(i) = Uj SubLS(i,j). 

15 Reducing the labeling space for each graphical feamre constimtes the 

first step towards finding a discrete labeling space for the input layout. This 
reduction of the labelmg space has the following interesting property: It clusters 
together regions of different graphical features such that a label placement of a 
given graphical feature which belongs to a given region can overlap only label 

20 placements of some other region of the same cluster. This reduction allows 
only high quality label placements to be considered in the Phase 1 of the 
algorithm, because each label placement in the final assignment of Phase 1 will 
be free of overlaps. Also, by dividiog the drawing G into clusters, the search 
space for potential conflicts (overlaps) is significantly reduced. 

25 The example of labeling the edges of a drawing of a graph (the ELP 

problem) will be used to illustrate the reduction concept. 

Given an input graph G, if the layout of G is planar (i.e, has no edge 
crossings), then the faces of that planar graph are the "clusters" of G. If the 
layout has crossings, then a planar graph Gclusters is constructed by 

30 introducing for each crossing a virtual node. The resulting graph is planar, and 
the faces of Gclusters are the clusters of G. Figmre 6(a) shows the layout of an 
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original graph G. Figure 6(b) shows a graph Gclusters which has been 
obtained from G by introducing virtual nodes, which are denoted by squares. 
Finally, in Figure 6(c), the shaded regions are the clusters of G. 

< Heuristics To Form The Overlap Graph > 
5 An overlap graph, denoted G^(y^,E^), has a structure that describes a 

discrete labeling space for a drawing and any overlaps of label placements that 
can occur. Introducing the overlap graph concept allows the reduced 
continuous labeling space to be mapped to a discrete labeling space, a step 
which is necessary to construct the bipartite matching graph, denoted as 

There are several ways to construct an overhip graph. Two simple 
heuristics will be described. 
HEURISTIC A 

Heuristic A first determines if the reduced labeling space of each 
15 graphical feature in the drawing overlaps the labeling space of other graphical 
features. Then it divides the labeling space of each graphical feature into three 

sets: 

1) The set NOV of all disjoint regions of the labeling space that do not 
overlap the labelii^ space of any other graphical feature. 
20 2) The set 20V of all disjoint regions of the labeling space that overlap 

the labeling space of exactly one other graphical feature. 

3) The set MOV of all disjoint regions of the labeling space that overlap 
the labeling space of at least two other graphical features. 

This partitioning of the solution space identifies the areas of the drawing 
25 where conflicts of label assignment can occur, and from that partition the 

structure of the overlap graph is revealed. This partition of the solution space 
identifies the areas of the drawing where conflicts of label assignment can 
occur, and from that partition the structure of the overlap graph is revealed. 
For example, if for some graphical feature a member of the set of NOV regions 
30 can include a label for that graphical feature, then there exists at least one label 
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placement that can be assigned to that graphical feature that is not in conflict 
with any other label placement of the drawing. 

There are two cases, however, where it is not immediately clear how the 
overlap graph is to be constructed: a region may not be large enough to include 
5 a label, or a region might be large enough to include more than one label 

placement. In the latter instance, if the region is a NOV region, then the label 
placement is chosen that has the highest ranking among all possible label 
placements for that region, or, if more than one label is allowed per graphical 
feature, then some number of the highest rankmg label placements is chosen, up 

10 to the maximum number of label placements for that region. If the region is a 
20V or MOV region, then a number of label placements is assigned to each 
graphical feature that is on the border of that region. 

In the former case where the region is not large enough to accommodate 
a label placement, that region is incorporated into to some other adjoining 

15 region of the same edge. Obviously, if a NOV region is combined with a 20V 
or MOV region, then the new region is 20V or MOV. Similarly, if a 20V 
region is combined with an MOV region, then the new region is MOV. 

Figure 11 gives a more detailed description of the algorithm of 
HEURISTIC A. The algorithm may best be appreciated from an example of 

20 the application of the algorithm to labeling of the edges of a layout of a graph 
(the ELP problem), such as the simple graph of Figure 7(a) having nodes 1, 2 
and 3 and edges (1,2), (2,3) and (1,3). 

Referring first to Figure 11, in Step 1, the partitioning described above 
is performed. For each cluster, each edge that is a part of the border of the 

25 cluster is considered in turn. The labeling space of that edge is divided into the 
three types of regions, NOV, 20V and MOV. Figure 7(a) shows how a cluster 
of a drawing is partitioned into the three types of regions. The labels of each 
edge for which label placements are to be found are shown in the left-hand 
portion of Figure 7(a). Within the cluster, the continuous labeling space for 

30 edge (1,2) may be imagined as being obtained by, in effect, sliding the label for 
edge (1,2) along the edge in its various possible orientations. The continuous 
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labeling space for edge (1,2) is shown by a solid line, the continuous labeling 
space for edge (2,3) is shown by a heavy dashed line, and the continuous 
labeling space for edge (1,3) is shown by a light dashed line. NOV, 20V and 
MOV regions are identified based on overlap of these continuous labeling 
5 spaces. 

These regions are shown by light dashed lines that, for clarity of 
illustration, have been extended outside the cluster. In the example of Figure 
7(a), the size of the labels is such that the innermost regions of the cluster does 
not belong to any of the continuous labeling spaces. Hence, in Figure 7(a), 

10 there are three NOV regions and three 20V regions but no MOV regions. 

These regions can be refined further according to constramts on the orientation 
of the labels. Figure 7(b) shows the refined partitioning of the labeling space 
when the labeling solutions are constramed to be parallel to the horizontal 
direction. This refined partitioning will be assumed for purposes of the 

15 remainder of the description of HEURISTIC A. 

Referring again to Figure 11, in Step 2, building of the overlap graph 
starts by, for each edge in the original graph, introducing a node representing 
that edge into the overlap graph. Then, as presently described, the discrete 
labeling space for each graphical feature is determined, with a node being 

20 introduced for each label position, joined by an edge to its corresponding 
graphical feahire. 

For purposes of the present description, recall that SubLS(i,k) is a set of 
k mutoally disjoint regions (sub-labeling spaces) of a graphical feature i, and 
RLS(i) (the reduced labeling space) is the union of such regions. For each 

25 cluster in the original graph layout, each edge is considered in turn. Fust the 
NOV portions within the cluster for the edge i are considered. If an NOV 
portion can accommodate at least one label placement, then a node for each 
such label placement is introduced into the overlap graph, joined by an edge to 
the node representing the edge i. The assignment of the set of potential label 

30 placements for each edge of the cluster after applying HEURISTIC A is shown 
in Figure 8(a), whereas Figure 8(b) shows die portion of the overlap graph 
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resulting from that cluster. Hence, referring to Figure 8(a), it is determined 
that, for edge (1,2), a label placement identified by the number 3 may be 
accommodated withm the NOV region for edge (1,2). A node 3 is placed in 
the overlap graph, shown in Figure 8(b), joined by an edge to the node (1,2), 
5 representing the edge in the original graph. 

If the region cannot accommodate a label placement, then the algorithm 
attempts to join the region to, in order of preference, an adjoining 20V region 
or an adjoining MOV region. If the region cannot be joined to an adjoining 
region, then the region cannot be used in the labeling solution and is deleted 

10 from the reduced labeling space. 

A similar sequence of steps is then performed for 20V regions of the 
edge i. Notice that in Figure 8(a), a determination has been made that the 20V 
region for edges (1,2) and (1,3) can accommodate two label placements, 
numbered 1 and 2, for the edge (1,2). Nodes 1 and 2 are therefore introduced 

15 into the overlap graph, Joined by edges to the node (1,2). If a 20Y region 
cannot accommodate a label placement, then the algorithm attempts to join the 
region to an adjoining MOV region, else the region is deleted from the reduced 
labeling space. 

A similar sequence of steps is then performed for MOV regions. If an 
20 MOV region cannot accommodate a label placement, then the region is shnply 
deleted. 

Finally, in Step 3 of Figure 11, all of the conflicts between the label 
placements determined in Step 2 are found. If two label placements overlap, 
then they are joined by an edge in the overlap graph. In the example of Figmre 
25 8(a), therefore, overlap between label placements 1 and 10 and between label 
placements 4 and 6 is indicated by adding respective edges between these label 
placements within the overlap graph of Figure 8(b). 

HEURISTIC B 

Heuristic B assigns to each graphical feature in G a number of discrete 
30 label positions in the reduced labeling space, and thus produces a discrete 

labeling space to be described by the overlap graph. Then the original graph 
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drawing is scanned to find potential conflicts (overlaps), and this infonnation is 
stored in the overlap graph. In the reduced labeling space in accordance with 
HEURISTIC B, only overlaps of label placements of different graphical 
features are possible. 

5 Figure 12 gives a more detailed description of this algorithm. The first 

portion of Step 1 of HEURISTIC B is the same as described previously for 
HEURISTIC A. Step 2 of HEURISTIC B is also the same as described 
previously for HEURISTIC A. HEURISTIC B differs from HEURISTIC A in 
the manner in which label placements are determined. 
10 In particular, in HEURISTIC B, for each edge in the original graph and 

for each sub-labeling space of that edge, a number of discrete labels are 
assigned to the edge. For each label so assigned, a node is introduced into the 
overlap graph. 

Using again as an example the labeling of the edges of a layout of a 
15 graph (the ELP problem) to illustrate HEURISTIC B, Figure 9(a) shows how in 
Stepl of HEURISTIC B discrete labels may be assigned by dividing a cluster 
by horizontal lines. Briefly, a cluster is divided by introducing equally spaced 
strips. The strips are arranged such that the height of some number, one or 
greater, of consecutive strips is equal to the unit height of the labeling solution. 
20 These strips divide the layout into equal height horizontal segments, and any 
potential label placement must lie entirely within a set of consecutive horizontal 
segments. 

Three sets of strips are shown. The first set of strips is for edge (1,2). 
These strips extend the length of edge (1,2) and are indicated in solid lines. 
25 The second set of strips is for edge (2,3) and is indicated in heavy dashed lines, 
and the third set of strips is for edge (1,3) and is indicated in light dashed lines. 
These two sets of strips extend only along the length of their respective edges, 
which in this example project onto different segments of the edge (1,2). 

Figure 9(b) shows the numbered label placements for each edge subject 
30 to the constraint that each label is contained within a number of unit-height 
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horizontal segments (in this example, one segment). Figure 9(c) shows the 
portion of the overlap graph resulting from that cluster. 

Note that in both HEURISTIC A and HEURISTIC B, in identifying a 
discrete labeling space, either all discrete label placements that follow a given 
5 rule may be chosen, or a subset of those label placements may be chosen. 

Therefore, in the first phase of the method, a loop can be inserted (as described 
previously in relation to Figure 5) that can help find a better set of discrete 
label placements by observing the size of the labeling solution assigned in the 
matching section of the first phase of the method. 

10 < Heuristics To Form The Matching Graph > 

In order to speed up the search for a labeling solution, the matching 
graph Gn,(Vn^,V^,EJ may be obtained by reducing the overlap graph 
Gov(Vov.Eov) to it. As has been noted, the matching graph is a bipartite graph 
such that each node in is a graphical feature of drawing G, and each node 

15 in is a class of relation R. Generally speaking, the overlap graph is 

reduced to a matching graph by deleting a number of label placements such that 
the rest of the label placements belong to exactly one class in R. 

The set of nodes of can be partitioned into two subsets, Vove and 
VovL where each node in V^ve is an edge in the original graph, and each node in 

20 VovL is a label. The goal is then to further partition the set of nodes m VovL i^to 
subsets such that each subset will be a node in V^. Each node in is a 
class of relation R. Let GR(V,^e„Eco„„in) be a subgraph of the overlap graph 
obtained by removing all nodes in Vove and their incident edges. The graph Gr 
represents the overlaps conflicts of the set of label placements W. 

25 There can be as many classes in R as the number of connected 

components in Gr. A connected component in Gr contains the members of a 
class if and only if it is a complete graph. If any of the connected components 
is not a complete graph, then the objective is to break it into new coimected 
components by removing one or more nodes and their incident edges. This is 

30 repeated until all the connected components are complete graphs. Then, each 
connected component is represented by a node in and is joined by edges in 
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En, to the nodes in V„,e to which label placements that are part of that connected 
component belong. 

The best reduction is the one that eliminates those nodes of Gr that do 
not affect the cardinality of the maximum matching of the bipartite graph G,„, 
5 since the maximum-cardinality minimum-weight matching of will give an 
optimal labeling solution with respect to the set of label placements W. Since 
this part of the GFLP problem is very likely NP-hard, a heuristic is used to 
reduce the overlap graph to the matching graph. 

Figure 13 describes in detail an algorithm, HEURISTIC C, that reduces 

10 the overlap graph to a matching graph. In Figure 13 and elsewhere in the 

present specification, a node v that represents an edge is denoted as Vg; a node 
Ve representing edge i is denoted as Vg;. A node that represents a label is 
denoted as v^; a node Vl representing a label of edge i is denoted as Vli. 

Referring to Figure 13, in Step 1, the graph Gr is built by first setting 

15 Gr to the overlap graph and then removing the nodes in the upper half of the 
graph, together with their incident edges. In Step 2, building of the matchmg 
graph Gn, is begun by adding a node in the upper half of the matching graph for 
each graphical feature (e.g., edge) in the original graph. 

It may be that there is only one possible label placement for a particular 

20 graphical feature, or edge. Choosing from a single possibility is easy. 

Therefore, in Step 3, if there is only one label placement node in the graph Gr 
that corresponds to edge i, then any nodes incident to this node (i.e., nodes for 
all label placements that conflict with this label placement) are removed. In the 
following Step 4, if a label placement node has degree zero, mdicating there 

25 were no conflicting label placement nodes or that such nodes have been 

removed, then a corresponding label placement node is added to the matching 
graph along with a connecting edge to the node representing the appropriate 
graphical feature. The label placement node having degree zero is removed 
from the graph Gr. 

30 . Then in Step 4, the connected components in the graph Gr are checked 

to see if they are complete graphs. A connected component that is a complete 
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graph represents a set of label placements from which, because every pair of 
label placements chosen form the set conflicts, one label placement may be 
chosen as part of the final labeling assignment. For each connected component 
that is a complete graph, therefore, a node is added to the matching graph and 
5 edges are added from that node to each graphical feature to which a label 
placement within the set corresponds. The coimected component is then 
removed from the graph Gr. 

In Step 5, only connected components that are not complete graphs 
remain. The objective is to selectively remove ("throw away") label placement 

10 nodes in order to form connected components that are complete graphs. Of 

course, the label placement nodes that are thrown away should be those that are 
least needed or least useful. If in the matching graph a node representing a 
graphical feature (e.g, edge) is already connected to at least two nodes of 
degree two (each representing sets of conflicting label placements), and if a 

15 node representing another graphical features is also connected to the same two 
nodes, then label assignments for both of the graphical feamres can be made: a 
label placement from one of the nodes of degree two can be assigned to one of 
the graphical features, and a label placement from the other node of degree two 
can be assigned to the other graphical feature. Hence, in a connected 

20 component that is not a complete graph, a label placement node that 

corresponds to either of the two graphical features in the case described above 
can be eluninated as imnecessary. It is removed from the graph, and program 
flow returns to Step 3. 

At some point, every instance of this special case will have been 

25 detected and the unnecessary label placement nodes elimmated. At this point, 
nodes representing label placements that are least usefiil are eliminated. In 
particular, if a label placement node is of high degree, then the label placement 
overlaps with a large number of other label placements. If in the connected 
component under consideration there is a node having a higher degree than all 

30 remaining nodes in the connected component, then that node is removed. 

Otherwise, a node is randomly removed from among a group of nodes in the 
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connected component that are tied for having the same highest degree. For 
example, if a connected component had nodes of degree 3, 2 and 1, then the 
node of degree 3 would be removed. If the connected component had nodes of 
degree 3, 3 and 1, then a random one of the nodes having degree 3 would be 
5 removed. Program flow then returns to Step 3. 

< Finding The Maximum Cardinality Minimum Weight Matching > 
One can prove that a maximum-cardinality minimum-weight matching of 
the matching graph Gjy^, W^, E J will give an optimal labeling solution 
with respect to the label placements in set W that are members of relation R. 
10 The size of the matching graph depends on the size of set W and the density of 
conflicts (overlaps). The size of the matching graph is a critical element m 
fmding the matching, since it takes quadratic time to And the 
maximum-cardinality minimirai weight matching of a graph. 

The following measures may be taken in order to fiirther reduce the time 
15 complexity of the matching: 

1) As a preprocessing step, apply Steps 1 and 2 of the algorithm in 
Figure 14, described below. In most cases, this reduces the size of the 
matching graph, after which a known matching algorithm may be applied. 

2) Find only the maximum-cardinality matching of the matching graph, 
20 which will give a labeling solution of the same size as the optimum solution, 

but in most cases with less quality. 

3) Reduce the matching graph further such that each node in 
corresponds to at most two graphical features of the original graph. The graph 

then has that property then each node in will have at most degree two. 
25 For such bipartite graphs, the algorithm of Figiure 14 runs in linear time and 
produces a maximum-cardinality matching. Furthermore, the algorithm 
guarantees that each node in will in the worst case be matched with the 
second-best (with respect to weight) incident edge. 

Let G'(Ve,Vl,E) be a bipartite graph such that each node in Vl has 
30 . degree at most two and each edge in E has a weight assigned to it. Then, the 
algorithm in Figure 14, HEURISTIC D, finds the maximum cardinality 
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matching for G'. Steps 1 and 2 deal with nodes in the matching graph of 
degree one. If a node Vlj has degree one, then the node represents a single 
label placement with no overlaps. It may be assigned to a graphical feature 
without any conflict, if it is the best label placement for that graphical feature. 
5 Hence, in Step 1, if the edge connecting the label placement node to the 
graphical feature node is the minimum-weight incident edge of the graphical 
feature node, then a match is assigned and recorded. The matching graph is 
then updated by removing the two nodes and their incident edges. 

In Step 2, if a node representmg a graphical feature has degree onci then 

10 that graphical feature only has one chance to receive a label assignment. The 
corresponding label placement is therefore assigned as a match and recorded, 
and the matching graph is updated. This step is repeated until no further 
matches are made. 

After Steps 1 and 2, the nodes representing graphical features that 

15 remain in the matching graph are of degree other than one, i.e, degree zero or 
degree two or more. Nodes of degree zero are deleted, since no match is 
possible. Then, for the remaining nodes of degree two or more, the two edges 
of least weight incident to each of these nodes are kept and the remaining edges 
are deleted. Each node representing a graphical feature now has degree two, 

20 and each node representing label placements has degree at most two. The 

remaining bipartite graph has an interesting structure. It is a simple cycle or a 
simple path. Any such graph has exactly two maximum-cardinality matchings. 
Since the number of edges is even, these matchings may be found by simply 
traversing the cycle or path and choosing as part of the matching only the 

25 even-numbered edges or only the odd-numbered edges. Of the two matchings, 
the matching that has the minimum weight is then chosen. 

Different input graph layouts may have different graph layout styles, 
such as orthogonal, hierarchical and symmetric. Examples of graph layouts in 
orthogonal, hierarchical and symmetric style as shown in Figure 16, Figure 17 

30 and Figure 18, respectively. In these layout styles, where edges on the average 
are long and neighboring edges travel alongside to their destination nodes, their 
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matching graphs naturally resemble the structure of the input graphs for 
HEURISTIC D. That makes HEURISTIC D attractive not only because of its 
reduced tune complexity but also because of the solutions that it produces. 

A matching graph that satisfies the conditions of HEURISTIC D may be 
5 obtained by splitting the layout space into equally-sized horizontal segments and 
allowing each label placement to overlap at most one other label placement m 
the same horizontal segment. 

<Phase2> 

Phasel produces a labeling solution that does not guarantee assignment 
10 of a label placement to each edge of the original graph. When there are edges 
of the original graph without label placements assigned to diem after Phase 1, 
techniques based on local search and exhaustive search may be employed to 
find label assignments for those edges. Phase 2 explores the solution space in 
two ways: 

15 1) It searches the solution space to find if there is enough space, with or 

without repositioning the previously assigned label placements, to assign label 
placements to edges without label assignments such that the new label 
placements do not overlap any graphical feature or other label placement of the 
drawing. 

20 2) It searches the solution space by relaxing the restrictions on the 

quality of the label placements by allowmg label placements to overlap their 
associated edges. Repositioning is permitted only if the size of the labeling 
solution is increased by such action. 

The approach employed in Phase 2 is basically to do an exhaustive 

25 search. Any such approach is exponential in nature. Because of this 
exponential nature, a constant number of steps only is allowed for each 
backtracking. The approach may therefore be further characterized as 
exhaustive search with pruning. Unnecessary backtracking steps may be 
bounded by using the information obtained in Phase 1, which reveals the 

30 conflict areas of the drawing. 
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In some cases, either because the algorithm is not successful in finding a 
complete labeling solution, or because the drawing is very dense such that a 
complete labeling solution is not feasible, the two phases of the algorithm 
produce a labeling solution that leaves some edges with no label placement 
5 assigned to them. At this point, a trivial approach would be to scale up the size 
of the drawing to create more space for label placements, or to scale down the 
size of the labels. Either approach might be acceptable. If scaling is not 
possible, one attractive solution (especially for interactive programs) is to assign 
interactive icons to unlabeled edges that can pop-up a small window with the 

10 full-size label for that edge when that icon is clicked on. This solution does not 
really solve the problem, since there are many cases where an interactive 
approach is not feasible (i.e., the printout of a drawing). The best choice in 
such an instance may be to place the label of an edge somewhere in the 
drawing (preferably close to its associated edge) and have some connection 

15 (e.g., an arrow) that connects that label to its associated edge. Figure 10(a) 
and Figure 10(b), respectively, show examples of using an icon instead of a 
label. Figure 10(c) shows how a label can be put in open space and connected 
to its associated edge. Furthermore, the placement of "problem" labels can be 
forced, regardless of the overlaps that are produced. It is then left up to the 

20 user to decide what to do and to manually place the problem labels if necessary. 

An example of a labeled graph drawing produced using the foregoing 
method is shown in Figure 19. 

< Conclusions > 
Algorithms that solve the GFLP problem have been presented. 
25 Experiments have shown that this approach to solving the GFU problem is 
very successful. Since the GFIP problem is NP-hard, neither this algorithm 
nor any other polynomial algorithm can be expected to produce optimal 
solutions. Even though the ELP problem has been used to illustrate the 
different algorithms of this method, the method can be easily generalized in a 
30 straightforward manner to the label placement problem for a collection of 
graphical feamres of a drawing. 
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For clarity of description, the invention has been described in terms of 
placing only one label per graphical feature. It should be understood, however, 
that the same underlying method may be applied to positionmg any number of 
labels, of any size, per graphical feature. The solution framework presented 
5 herein is very flexible and acconmiodates the needs of the user by allowing die 
user to define a variation of the GFLP problem and to solve the problem using 
essentially the same method as presented in Figure 5. 

For example, one may want to find a labeling solution such that each 
graphical feature has more than one label associated with it. This may be 

10 accomplished by, mstead of find the maximum cardinality minimum weight 
matching of the matching graph, finding a maximum cardinality minimum 
weight "b-matching. " As is known in the prior art, a b-matching of a bipartite 
graph is a matching where, instead of allowing only one matched edge to be 
incident to each node in the upper half of the bipartite graph, a predefined 

15 number of edges (say, at most b) are allowed to be incident to each node in the 
upper half. This problem can be solved using known polynomial-time 
algorithms that solve the maximum flow minimum cost problem, or the b-factor 
problem. 

It will be appreciated by tiiose of ordinary skill in die art that the 
20 invention can be embodied in other specific forms wifliout departing from the 
spirit or essential character thereof. The presently disclosed embodiments are 
therefore considered in all respects to be illustrative and not restrictive. The 
scope of the invention is indicated by the appended claims rather than the 
foregoing description, and all changes which come within the meaning and 
25 range of equivalents tiiereof are intended to be embraced tiierein. 
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In the Claims: 

1. A method of assigning label positions to graphical features within 
a graphical layout, comprising the steps of: 

identifying a subset of potential label placements from among a 



2. The method of Claim 1, wherein the matching graph is a 

10 bipartite graph, nodes within an upper half of the graph representing graphical 
feamres and nodes within a lower half of the graph representing potential label 
placements. 

3. The method of Claim 2, wherein finding a high-cardinality 
matching of the matching graph comprises finding a maximum-cardinality 

15 matching of the matching graph. 

4. The method of Claim 3, wherein each label placement has a 
weight associated therewith. 

5. The method of Claim 4, wherein finding a high-cardinality 
matching of the matching graph comprises finduig a maximum-cardinality 

20 minimum-weight matching of the matching graph. 

6. The method of Claim 5, wherein each node within the lower half 
of the graph represents a set of label placements each pair of label placements 
of which overlap. 



5 



set of potential label placements; 

representing the graphical features and the potential label 
placements as nodes within a matching gr^h; and 

finding a high-cardinality matching of the matching graph. 



7. 



The method of Claim 6, wherein each node within the lower half 



25 



of the matching graph is of at most degree two. 
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8. The method of Claim 7, wherein identifying a subset of potential 
label placements comprises:dividing the graphical layout into a number of 
horizontal segments. 

9. The method of Claim 8, wherein each potential label placement 
5 occupies only one horizontal segment. 

10. The method of Claim 9, wherein each potential label placement 
overlaps at most one other potential label placement. 



11. The method of Claim 7, comprising the further step of, given a 
matching graph in which at least one node in the lower half of the matching 
10 graph is of degree greater than two, reducing the matching graph to obtain a 
matching graph wherein each node within the lower half of the matchmg graph 
is of at most degree two. 



12. The method of Claim 7, wherein finding a maximum-cardinality 
minimum- weight matching comprises traversing the lower half of the matching 

15 graph, selecting alternate edges. 

13. The method of Claim 11, comprising the steps of: 
selecting odd-numbered alternate edges and computing a 

corresponding weight; 

selecting even-number alternate edges and computing a 
20 corresponding weight; and 

finally selecting one of the odd-numbered alternate edges and the 
even-numbered alternate edges according to which selection gives the 
minimum weight. 



14. The method of Claun 6, comprising the further steps of: 
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representing overlaps of the potential label placements using an 
overlap graph; and 

building the matching graph using information derived from the 
overlap graph. 



5 15. The method of Claim 14, wherein nodes in the overlap graph 

represent label placements and edges between such nodes represent overlap 
between pairs of label placements. 



16. The method of Claim 15, wherein building the matching graph 
comprises: 

10 identifying within the overlap graph connected components that 

are complete graphs; and 

for each such complete graph, inserting a node in the matching 
graph and connecting to the node an edge for each graphical feature to 
which a label placement represented in the comected component 

15 corresponds. 



17. The method of Claim 16, comprising the fiirther steps of: 

selectively eliminating at least one node from the overlap graph; 

and 

repeating the steps of Claim 16. 



20 18. The method of Claim 7, wherein identifying a subset of potential 

label placements comprises: 

for a plurality of graphical features, identifying in accordance 
with at least one predetermined criterion areas within which labels for 
the respective graphical features may be placed; and 
25 based on the preceding step, identifying a plurality of regions in 

accordance with the nimiber of graphical features for which labels may 
be placed within each region. 
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19. The method of Claim 18, comprising the further steps of, for 
each region, in preferential order from regions having only one graphical 
feature for which labels may be placed within the region to regions having 
more than two graphical features for which labels may be placed within the 
5 region: 

determining a number of label placements for one of the 
graphical features that will fit within the region; 

if the number is zero, merging the region with an adjacent 
region; and 

10 if the number is non-zero, placing an equal number of nodes in 

the overlap graph. 
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PHASE 2: 
BACKTRACKING 



FIGS 
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HEURISTIC A 

BEGIN 

STEP1 

For each cluster do: 

For each edge that is part of the border of this cluster do: 

Divide the labeling space of that edge into 3 types of regions: 

a) The portions NOV of the labeling space that is free of overlaps. 

b) The portions 20V of the labeling space that overlaps the labeling 

space of exactly one other edge. 

c) The portions MOV of the labeling space that overlaps the labeling 

space of more than one other edges. 

STEP 2 

We build the overlap graph Gov(Vov.Eov) as follows: 
For each edge i in the original graph we introduce a node Vj in Gov 
For each cluster cl of the layout do: 

For each edge i and each k such that SubLS(i,k) nc\^0 do: 
For each portion NOV s in cl of i do: 

If s is enough to Include a number of discrete labels for i then 
Introduce a node Vs in Gov and an edge (Vj.Vs) in Eov 

Else 

If a 20V region of i is adjacent to s then 
Increase that region to include s. 

Else 

If a MOV region of I is adjacent to s then 
Increase that region to include s 

Else 

Remove s from RLS(i). 
For each portion 20V s in cl of i do: 

If s is enough to include a number of discrete labels for i then 
For each such label do: 

Introduce a node Vo in Gov and an edge(Vj, Vs) in Eov- 

Else 

If a MOV region of i Is adjoint to s then 
Increase that region to include s 

Else 

Remove s from RLS(i) 
For each portion MOV s in ci of i do: 

If s is enough to include a number of discrete labels for i then 
For each label do: 

introduce a node Vs in Gov and an edge (V|,Vs) in Eov 

Else 

Remove s from RLS(I) 

STEPS 

Find all the conflicts between the labels of STEP 2. 
If labels ii and I2 overlap then 

Add an edge (vii,V|2) between their corresponding to nodes in Gov 

END. 
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HEURISTIC B 

BEGIN 

STEP1 

We build the overlap graph Gov(Vov» Eov) as follows. 

For each edge I in G do: 

Introduce a node ve! in Gov 
For each edge i in G do: 

For each region SubLS(i,j) do: 

Assign a number of discrete labels to i. 
For each edge I do: 

For each label assigned to i do: 

Introduce a node vl in Gov and an edge (VEi,vu) in Eov 

STEP 2 

Find ail the conflicts between the labels of STEP 1 . 
If labels H and la overlap then . 

Add an edge (vm.vk) between their corresponding to nodes in G, 

END. 
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Reduction: Heuristic C 

BEGIN 

STEP1 

We build graph Gr as follows: 
Gr = Gov 

For eacli node v in Gr do: 

If node V corresponds to an edge of tlie original graph 
Remove node v from Gr and its incident edges. 

STEP 2 

We build Gm in the following way: 

For each edge 1 of the original graph do: 
Add a node vgj in VmE- 

STEPS 

For each edge I do: 

If there is only one node V| in Gr con-espondlng to a label of edge i then: 
Remove all nodes incident to m\. 

STEP 4 

For each node vli in Viabei do: 
If VLi has degree 0 then 

Add this node in VmL and add an edge in Em that connects this node 
to node uej in VmE- 
Remove vy from Gr. 
For each connected component cc in Gr do: 
If cc is a complete graph then 
Add a node v in VmL- 
For each node vy in cc do: 

Add an edge in Em that connects node v to node vei in VmE- 
Remove cc from Gr. 

STEPS 

For each connected component cc left in Gr do: 
For each node vy in cc do: 

If a node uei in VmE is connected to at least 2 nodes in VmL of degree 2 
that connect to the same node in VmE. then 
Remove node uei from cc 
Return to Step 3. 
Remove the node from cc with the highest degree. 
Return to Step 3. 

Randomly remove a node from cc. 
Return to Step 3. 
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Heuristic D 



If tlie minimum weight incident edge of a node in Ve connects tills node to 
a node in Vl of degree 1 thien: 

Assign this edge as a matched edge. 

Update G'. 

STEP 2 

If a node in Ve has degree 1 then 

Assign Its incident edge as a matched edge. 

Update graph G'. 
Repeat STEP 2 until no new edge has been matched. 
STEP 3 

G' consists of nodes in Ve that have degree 0 or degree at least 2. 
Delete those nodes from G' with degree 0. 

For each node in Ve we keep only the 2 edges of least weight and remove 
the rest of the edges from G'. 

Now each node in Ve has degree 2, and each node in Vl has degree at 
most 2. 

The remaining bipartite graph has an interesting structure. It is a simple cycle 
or a simple path. Any such graph has exactly 2 maximum cardinality matchings. 
Since the number of edges is even, we can find by just traversing the cycle 
or path and pick as part of the matching only the even or odd numbered 
edges. 

Choose the one matching of minimum weight. 



Note: 

The Update(G') operation removes the 2 nodes incident to a new matched edge 
and stores that edge and its incident nodes as part of the matching. Also removes 
all incident edges from the 2 nodes. 
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INPUT: 

A drawing G and a set of graphical features GF. 
BEGIN 
PHASE 1 
START LOOP 

STEP1 

Compute an overlap graph Gov for G 

(for each iteration use a different set of discrete labels to form the overlaping 
graph. In the first iteration use the maximum number of labels and for each 
consecutive iteration remove those labels that produce the most overlaps 
or labels that belong to graphical features that have a number of labels 
associated with them already in the overlaping graph.) 
STEP 2 

Reduce the overlap graph Gov to a matching graph Gmatching 
STEPS 

If specified further reduce the matching graph. 
STEP 4 

Run the matching algorithm on the matching graph. 
If the matching is complete 

Exit the program a labeling assignment found. 

Else 

If the size of the labels assigned in the matching is above a user-set 
threshold and(or) after a number of iterations 
Exit loop 

END LOOP 
PHASE 2 
STEP1 

Run Backtracking algorithm 
STEP 2 

If there are graphical features with no labels assigned 

Relaxe restrictions on overlaping of labels with any graphical feature 

of the drawing. 

Run Backtracking algorithm 

STEPS 

If there are graphical features with no labels assigned 

Force placement of labels or do not assign labels or assign labels 
as shown in Figure 6. 

END. 
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